Optimization of N-Queens Solvers on Graphics Processors
نویسندگان
چکیده
While graphics processing units (GPUs) show high performance for problems with regular structures, they do not perform well for irregular tasks due to the mismatches between irregular problem structures and SIMD-like GPU architectures. In this paper, we explore software approaches for improving the performance of irregular parallel computation on graphics processors. We propose general approaches that can eliminate the branch divergence and allow runtime load balancing. We evaluate the optimization rules and approaches with the n-queens problem benchmark. The experimental results show that the proposed approaches can substantially improve the performance of irregular computation on GPUs. These general approaches could be easily applied to many other irregular problems to improve their performance.
منابع مشابه
PB-smodels a Pseudo-Boolean Solver
A pseudo-boolean constraint (PB-constraint) is a linear inequality with 0-1 variables and integer coefficients. Search problems can be modeled as sets of PB-constraints, so that, solutions to a set of PB-constraints map to solutions of the search problem. An optimization problem can be modeled using a set of PB-constraints along with an optimization statement over a set of 0-1 variables. Pseudo...
متن کاملImplementation of an Efficient Conjugate Gradient Algorithm for Poisson Solutions on Graphics Processors
Programmable graphics processors have achieved the distinction of being very efficient and cost-effective in terms of floating-point capacity, thereby making it an attractive option for scientific computing. In this paper, we discuss the implementation of the Conjugate Gradient iterative solver on a graphics processor. A Poisson equation is solved with the graphics processor on an unstructured ...
متن کاملGpu Computing with Msc Nastran 2013
Current trends in HPC (High Performance Computing) are moving towards the use of many core processor architectures in order to achieve speed-up through the extraction of a high degree of fine-grained parallelism from the applications. This hybrid computing trend is led by GPUs (Graphics Processing Units), which have been developed exclusively for computational tasks as massively-parallel co-pro...
متن کاملAccelerating Double Precision FEM Simulations with GPUs
In visualization and computer graphics it has been shown that the numerical solution of PDE problems can be obtained much faster on graphics processors (GPUs) than on CPUs. However, GPUs are restricted to single precision floating point arithmetics which is insufficient for most technical scientific computations. Since we do not expect double precision support natively in graphics hardware in t...
متن کاملComplexity of n-Queens Completion
The n-Queens problem is to place n chess queens on an n by n chessboard so that no two queens are on the same row, column or diagonal. The n-Queens Completion problem is a variant, dating to 1850, in which some queens are already placed and the solver is asked to place the rest, if possible. We show that n-Queens Completion is both NP-Complete and #P-Complete. A corollary is that any non-attack...
متن کامل